Automated evaluation of non-native English pronunciation quality: combining knowledge- and data-driven features at multiple time scales

نویسندگان

  • Matthew Black
  • Daniel Bone
  • Z.-I. Skordilis
  • Rahul Gupta
  • Wei Xia
  • Pavlos Papadopoulos
  • Sandeep Nallan Chakravarthula
  • Bo Xiao
  • Maarten Van Segbroeck
  • Jangwon Kim
  • Panayiotis G. Georgiou
  • Shrikanth S. Narayanan
چکیده

Automatically evaluating pronunciation quality of non-native speech has seen tremendous success in both research and commercial settings, with applications in L2 learning. In this paper, submitted for the INTERSPEECH 2015 Degree of Nativeness Sub-Challenge, this problem is posed under a challenging crosscorpora setting using speech data drawn from multiple speakers from a variety of language backgrounds (L1) reading different English sentences. Since the perception of non-nativeness is realized at the segmental and suprasegmental linguistic levels, we explore a number of acoustic cues at multiple time scales. We experiment with both data-driven and knowledge-inspired features that capture degree of nativeness from pauses in speech, speaking rate, rhythm/stress, and goodness of phone pronunciation. One promising finding is that highly accurate automated assessment can be attained using a small diverse set of intuitive and interpretable features. Performance is further boosted by smoothing scores across utterances from the same speaker; our best system significantly outperforms the challenge baseline.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Efficacy of a Communicative Framework in Teaching English Phonological Features Absent in Persian to Iranian EFL Learners

Although Persian and English share many common phonemes, there are some phonological features that are present in English but absent in Persian which tend to lead to mispronunciation on the part of Persian learners of English, mostly through negative transfer. The present research assesses the efficacy of a communicative framework in improving Iranian adult EFL learners’ pronunciation of five E...

متن کامل

New Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences

We have previously proposed a statistical method for estimating the pronunciation proficiency and intelligibility of presentations made in English by non-native speakers. To investigate the relationship between various acoustic measures and the pronunciation score and intelligibility, we statistically analyzed the speaker’s actual utterances to find combinations of acoustic features with a high...

متن کامل

Robust Recognition and Assessment of Non-native Speech Variability

Motivated by a desire to assess speaking and reading skills and perform unsupervised tutoring of non-native speakers in a foreign language, robust evaluation of speech variability and pronunciation quality must incorporate perceptually meaningful information from many domains of speech analysis – spectral and prosodic, segmental and suprasegmental, and so on. In this paper we present three tech...

متن کامل

Acoustic Analysis of Persian EFL Learners' Pronunciation of English Vowels

This paper reports the results of an experimental study on non-native production of English vowels. Two groups of Persian EFL learners varying in language proficiency were tested on their ability to produce the nine plain vowels of American English. Vowel production accuracy was assessed by means of acoustic measurements. Ladefoged and Maddison’s (1996) F1 F2 measurements for American English v...

متن کامل

Improved pronunciation features for construct-driven assessment of non-native spontaneous speech

This paper describes research on automatic assessment of the pronunciation quality of spontaneous non-native adult speech. Since the speaking content is not known prior to the assessment, a two-stage method is developed to first recognize the speaking content based on non-native speech acoustic properties and then forced-align the recognition results with a reference acoustic model reflecting n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015